Scene Segmentation of 3D Kinect Images with Recursive Neural Networks

نویسندگان

  • Charles Chen
  • Jack Chen
  • Alex Ryan
چکیده

In this project, we study scene segmentation of images from the Microsoft Kinect using deep learning techniques. The Kinect gives a depth map of the scene in addition to a standard RGB image, so we are extending methods for scene segmentation and object recognition developed by Socher’s group which were previously applied to two-dimensional images. Socher’s algorithm parses scenes using recursive neural networks (RNNs) to discover recursive structure in the image.1 With the availability of a large standardized data set gathered from the Kinect by researchers at NYU,2 a natural step is to extend Sochers code to apply the RNN-based algorithm to the three-dimensional Kinect images. Our algorithm starts by oversegmenting the image based on RGB data with the Edge Detection and Image Segmentation (EDISON) system. We compute vision features in these superpixels and map these into a neural network which outputs a “semantic” feature representation for the superpixels. The vision features that we use involve both standard two-dimensional image features as well as features based on the depth information gathered from the Kinect, including measures of the distance and normal vector. The recursive neural network then computes a score indicating whether pairs of adjacent segments should be identified as part of a larger

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Automated MR Image Segmentation System Using Multi-layer Perceptron Neural Network

Background: Brain tissue segmentation for delineation of 3D anatomical structures from magnetic resonance (MR) images can be used for neuro-degenerative disorders, characterizing morphological differences between subjects based on volumetric analysis of gray matter (GM), white matter (WM) and cerebrospinal fluid (CSF), but only if the obtained segmentation results are correct. Due to image arti...

متن کامل

A hierarchical Convolutional Neural Network for Segmentation of Stroke Lesion in 3D Brain MRI

Introduction: Brain tumors such as glioma are among the most aggressive lesions, which result in a very short life expectancy in patients. Image segmentation is highly essential in medical image analysis with applications, particularly in clinical practices to treat brain tumors. Accurate segmentation of magnetic resonance data is crucial for diagnostic purposes, planning surgical treatments, a...

متن کامل

Parsing Natural Scenes and Natural Language with Recursive Neural Networks

Recursive structure is commonly found in the inputs of different modalities such as natural scene images or natural language sentences. Discovering this recursive structure helps us to not only identify the units that an image or sentence contains but also how they interact to form a whole. We introduce a max-margin structure prediction architecture based on recursive neural networks that can s...

متن کامل

A hierarchical Convolutional Neural Network for Segmentation of Stroke Lesion in 3D Brain MRI

Introduction: Brain tumors such as glioma are among the most aggressive lesions, which result in a very short life expectancy in patients. Image segmentation is highly essential in medical image analysis with applications, particularly in clinical practices to treat brain tumors. Accurate segmentation of magnetic resonance data is crucial for diagnostic purposes, planning surgical treatments, a...

متن کامل

A multi-scale convolutional neural network for automatic cloud and cloud shadow detection from Gaofen-1 images

The reconstruction of the information contaminated by cloud and cloud shadow is an important step in pre-processing of high-resolution satellite images. The cloud and cloud shadow automatic segmentation could be the first step in the process of reconstructing the information contaminated by cloud and cloud shadow. This stage is a remarkable challenge due to the relatively inefficient performanc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011